3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English German
Availability:
Freely Available
License:
<Not Specified>
Size:
747 KByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:FooTweets: A Bilingual Parallel Corpus of World Cup Tweets
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Henny Sluyter-Gäthje | University of Hamburg | DE | ||
| Author 2 | Pintu Lohar | Dublin City University | IE | ||
| Author 3 | Haithem Afli | ADAPT Centre | IE | ||
| Author 4 | Andy Way | ADAPT, Dublin City University | IE | CNGL, Dublin City University | IE |
| Main Contact | Pintu Lohar | Dublin City University | None |
Documentation:
https://github.com/HAfli/FooTweets_Corpus/blob/master/README.md
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
free for research use
Size:
2.15 MByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Incorporating Label Dependency for Answer Quality Tagging in Community Question Answering via CNN-LSTM-CRF
-
Paper track:Information Retrieval, Information Extraction, Question Answering
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Yang Xiang | Harbin Institute of Technology Shenzhen Graduate School | CN |
| Author 2 | Xiaoqiang Zhou | Harbin Institute of Technology Shenzhen Graduate School | N/A |
| Author 3 | Qingcai Chen | Harbin Institute of Technology Shenzhen Graduate School | N/A |
| Author 4 | Zhihui Zheng | Harbin Institute of Technology Shenzhen Graduate School | N/A |
| Author 5 | Buzhou Tang | Harbin Institute of Technology Shenzhen Graduate School | N/A |
| Author 6 | Xiaolong Wang | Harbin Institute of Technology Shenzhen Graduate School | N/A |
| Author 7 | Yang Qin | Harbin Institute of Technology Shenzhen Graduate School | N/A |
| Main Contact | Yang Xiang | Harbin Institute of Technology Shenzhen Graduate School | None |
Documentation:
publicly available documentation written in englishLanguage Type:
Trilingual
Languages:
Czech English Mandarin Chinese
Availability:
From Data Center(s)
License:
LDC
Size:
eight million tokens Production Status:
Newly created-finished
Use:
Semantic Role Labeling
-
Paper title:Towards Comparability of Linguistic Graph Banks for Semantic Parsing
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Stephan Oepen | Universitetet i Oslo | NO |
| Author 2 | Marco Kuhlmann | Linköping University | SE |
| Author 3 | Yusuke Miyao | National Instutite of Informatics | JP |
| Author 4 | Daniel Zeman | Charles University in Prague, Faculty of Mathematics and Physics | CZ |
| Author 5 | Silvie Cinkova | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | CZ |
| Author 6 | Dan Flickinger | Stanford | US |
| Author 7 | Jan Hajic | Charles University in Prague | CZ |
| Author 8 | Angelina Ivanova | University of Oslo | NO |
| Author 9 | Zdenka Uresova | Charles University in Prague | CZ |
| Main Contact | Stephan Oepen | Universitetet i Oslo | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
Apache 2.0
Size:
2 MByte Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Hashtag Occurrences, Layout and Translation: A Corpus-driven Analysis of Tweets Published by the Canadian Government
-
Paper track:Multimodality
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Fabrizio Gotti | RALI | CA |
| Author 2 | Phillippe Langlais | Université de Montréal | CA |
| Author 3 | Atefeh (Anna) Farzindar | NLP Technologies | US |
| Main Contact | Fabrizio Gotti | RALI | None |
Documentation:
<Not Specified>
Written
Tagger/Parser,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
GPL v3
Size:
<Not Specified> <Not Specified>Production Status:
Existing-updated
Use:
Parsing and Tagging
-
Paper title:Enhanced English Universal Dependencies: An Improved Representation for Natural Language Understanding Tasks
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Sebastian Schuster | Stanford University | US |
| Author 2 | Christopher D. Manning | Stanford University | US |
| Main Contact | Sebastian Schuster | Stanford University | None |
Documentation:
http://nlp.stanford.edu/software/lex-parser.shtml
Speech
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
CC BY 4.0
Size:
1000 hours Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Ali Can Kocabiyikoglu | University of Grenoble Alpes | FR |
| Author 2 | Laurent Besacier | LIG | FR |
| Author 3 | Olivier Kraif | University of Grenoble Alpes | FR |
| Main Contact | Ali Can Kocabiyikoglu | University of Grenoble Alpes | None |
Documentation:
'LibriSpeech: an ASR corpus based on public domain audio books'', Vassil Panayotov, Guoguo Chen, Daniel Povey and Sanjeev Khudanpur, ICASSP 2015'Language Type:
Multilingual
Languages:
English french
Availability:
Not Available
License:
In progress
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas
-
Paper track:Multimodality
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Sashi Novitasari | Bandung Institute of Technology | ID | ||
| Author 2 | Quoc Truong Do | Nara Institute of Science and Technology | JP | ||
| Author 3 | Sakriani Sakti | Nara Institute of Science and Technology | JP | ||
| Author 4 | Dessi Lestari | Bandung Institute of Technology | ID | ||
| Author 5 | Satoshi Nakamura | Nara Institute of Science and Technology | JP | ||
| Main Contact | Sakriani Sakti | Nara Institute of Science and Technology | None | Nara Institute of Science and Technology (NAIST) / RIKEN AIP | None |
Documentation:
Not AvailableLanguage Type:
Multilingual
Languages:
English
Availability:
Will be available as soon as possible, hopefully before the conference date
License:
CC BY-NC-ND 3.0
Size:
118 hours Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:TED-LIUM: an Automatic Speech Recognition dedicated corpus
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Anthony Rousseau | Université du Maine, LIUM | None | ||
| Author 2 | Paul Deléglise | LIUM | FR | Université du Maine, LIUM | None |
| Author 3 | Yannick Estève | LIUM | FR | Université du Maine, LIUM | None |
| Main Contact | Anthony Rousseau | LIUM | FR |
Documentation:
<Not Specified>Language Type:
Trilingual
Languages:
English Japanese Japanese Sign Language
Availability:
Not Available
License:
<Not Specified>
Size:
1000 sentences Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Deep JSLC: A Multimodal Corpus Collection for Data-driven Generation of Japanese Sign Language Expressions
-
Paper track:Multimodality
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Heike Brock | Honda Research Institute | JP |
| Author 2 | Kazuhiro Nakadai | Honda Research Institute | JP |
| Main Contact | Heike Brock | Honda Research Institute | None |
Documentation:
Documentation in English and Japanese for internal use
Written
semantic concordance (lexicon interlinked with a corpus annotation),
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
Creative Commons BY-NC-SA
Size:
30 <Not Specified>Production Status:
Newly created-finished
Use:
Word Sense Disambiguation
-
Paper title:A database of semantic clusters of verb usages
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Silvie Cinková | Charles University in Prague | None |
| Author 2 | Martin Holub | Charles University in Prague | None |
| Author 3 | Adam Rambousek | Masarykova Univerzita, Brno | None |
| Author 4 | Lenka Smejkalová | Charles University in Prague | None |
| Main Contact | Silvie Cinkova | Charles University in Prague | CZ |
Documentation:
http://ufal.mff.cuni.cz/spr, English, publicly available




